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RSSE OPTIMIZATION USING HARDWARE ACCELERATION 



CROSS-REFERENCE TO RELATED APPLICATIONS 
[0001] Not Applicable 

STATEMENT OF FEDERALLY SPONSORED RESEARCH OR DEVELOPMENT 
[0002] Not Applicable 
5 BACKGROUND OF THE INVENTION 

1. TECHNICAL FIELD 

[0003] This invention relates in general to communications and, more 
particularly, to a method and apparatus for optimizing RSSE calculations. 

2. DESCRIPTION OF THE RELATED ART 

10 [0004] Compared to binary modulation commonly used in 2G standards, 
advanced wireless communication systems may use high-level modulation 
schemes. From a system point of view, this results in spectral efficiency at the 
expense of an increase of Inter-Symbol Interference (ISI). 

[0005] In order to mitigate ISI, efficient equalizers need to be implemented on 
15 the receiver side. The optimal solution to this problem is known as Maximum 
Likelihood Sequence Estimation (MLSE). The goal of this method is to subtract 
every possible sequence of L constellation points weighted by the channel 
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estimation factors from the received symbol vector in order to determine the 
most probable symbols. L corresponds to the channel estimation length. 

[0006] The concept behind MLSE is to choose the minimum error square 
between the received symbol yk and every possible linear combination of the last 
5 L symbols h-i weighted by the channel response samples hi, all values being 
complex. In a mathematical form, the approximated symbol frame of length 
Lframe is given by: 

^ frame ~1 L~\ 

a = arg{min- £ (II ^ -£ fc A-/(*) II ^ » 

*=0 ;-0 

[0007] This inner term of the equation, the branch metric, can be rewritten as: 
10 ly.-Yhfl^-Ka^f 

[0008] The branch metric (brri) is composed by three terms. The first one is the 
soft value of the received symbol, the second one is the estimated contribution of 
the ISI coming from the L-1 preceding symbols and the third one. is the supposed 
received symbol multiplied by a channel coefficient. 

15 [0009] The a kM (s) for / = 1,..., L - 1 fully determine a state of the MLSE in a 

trellis representation. Thus, the number of possible combinations or states, taking 
into account that a k _ t (s) belongs to a set of M symbols and that K>L-1 with the 

channel response being of length L, is equal to M K . Furthermore, M transitions 
output each state. This makes MLSE too complicated for implementation in 
20 terminals receivers. 

[0010] A more adapted method, the Reduced State Sequence Equalizer (RSSE) 
is derived from MLSE. The simplification lies in the reduction of the number of 
states. Like for the MLSE, the RSSE method consists in minimizing a sum of 
square values (branch metrics) among all the symbols of the frame under 
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consideration. Therefore, a trellis structure is well adapted to carry out this task. 
Compared to convolutional decoding, there are, for the general case, no 
redundancy properties among the branch metrics. 

[0011] For each symbol received, i.e. on trellis transition, the following units 
5 are executed: (1) Branch Metric Computation (BMC) Unit: Compute individually 
one branch metric per transition, (2) Add Compare and Select (ACS) Unit: With 
more than two transitions per state and (3) Post-Processing. Current 
implementations do not use generic structure to perform the BMC and ACS 
functions. Each branch metric is computed, added to the previous state value 
10 and the minimum path is selected for all next states. 

[0012] While RSSE requires only a fraction of calculations required by MLSE, 
it still is computation intensive, using significant processing resources. 
Therefore, a need has arisen for a method and apparatus for performing RSSE 
computations efficiently. 
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BRIEF SUMMARY OF THE INVENTION 

[0013J In the present invention, a sequence of probable symbols is generated 
from a sequence of received symbols using Reduced State Sequence Estimation. 
Butterfly circuitry computes terms sm ] '= min{sm ] + m,sm 2 - m) and 
5 sm 2 ' == mm{sm ] - m, sm 2 + m) . Path metrics are computed between a first state and 
a second state responsive to the received symbols and reference constellation 
symbols and a best scenario path is determined at the second state using said 
butterfly circuitry. 

[0014] The present invention allows branch metric and the accumulate, 
10 compare and select (ACS) functions to be aided by use of a butterfly hardware 
accelerator available to support Viterbi decoding. 
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BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWINGS 

[0015] For a more complete understanding of the present invention, and the 
advantages thereof, reference is now made to the following descriptions taken in 
conjunction with the accompanying drawings, in which: 

5 [0016] Figures la-d illustrate four different possible cosets for 8PSK 
modulation; 

[0017] Figure 2 illustrates a trellis structure for an 8PSK 2-state RSSE and its 
decomposed version for butterfly execution; 

[0018] Figure 3 illustrates a butterfly structure; 

10 [0019] Figure 4 illustrates a block diagram showing the operation of an RSSE 
equalizer improved by hardware acceleration; 

[0020] Figure 5 illustrates how a Butterfly structure can be used to compute 
branch metrics; 

[0021] Figure 6 illustrates a n/ 8 counterclockwise rotation of a 8PSK 
15 constellation to produce symbols which can be accelerated by a Butterfly 
structure; 

[0022] Figure 7 illustrates how 2 Butterfly structures can be used to compute 
branch metrics relative to all constellation symbols of the rotated 8-PSK 
constellation; and 

20 [0023] Figure 8 illustrates a block diagram of a mobile communication device 
using a DSP with the accelerated RSSE. 



5 



TI-33782 



PATENT APPLICATION 



DETAILED DESCRIPTION OF THE INVENTION 

[0024] The present invention is best understood in relation to Figures 1-8 of 
the drawings, like numerals being used for like elements of the various 
drawings. 

5 [0025] The Reduced State Sequence Equalizer (RSSE) is derived from the 

MLSE. In its most complicated form, it is equivalent to the MLSE and in its most 
simplified form to the DFE (Decision Feedback Equalization) . Thus, its 
complexity is comprised between the MLSE complexity and the DFE complexity. 

[0026] The RSSE principle restricts the number of possible combination of 
10 past symbols to be tested by regrouping them inside so-called cosets. A coset is a 
collection of one or more symbols of the signal set. All symbols of the 
constellation need to be in one and only one coset. The 1 th coset at time n-k is 
represented by Ai(k). Jk-i is the number of cosets at time n-k and is naturally 
comprised between 1 and the number of symbols of the signal set: 1 < J k < M f or 

15 \fk . Furthermore, the analysis here below is restricted to values of Jk equal to 
power of 2, e.g. for a 8-PSK constellation we have J k e [1,2,4,8], and decreasing or 
equal values of Jk for increasing values of k: J k < J k _ x . This last restriction means 
that the older the symbol of the ISI sum, the less cosets at that time. 

[0027] Figures la-d illustrates four different possible cosets for 8-PSK 
20 modulation. As stated above, the idea behind the RSSE is to reduce the number 
of states of the MLSE. Thus, while a MLSE implementation would have M K 

states, the RSSE implementation would have Y\j k states, where l < Y\ J n ^ mK > 

k=0 k=6 

while both the MLSE and RSSE implementation have M branch metrics 
outputting every state. In the RSSE, the number of states is given by Yl^k I R 

k 

25 the MLSE, Jk is always equal to the number M of symbols in the constellation. 
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[0028] The RSSE selects the best symbol sequence by minimizing the sum of 
each Square Euclidean Distance of the estimated ISI term subtracted from the 
received samples. At each transition, each previous state issues branch metrics 
related to every symbol a 1 belonging to the reference constellation that reach the 
next states where the symbol is contained in the coset Aj. 

[0029] Figure 2 illustrates a trellis structure for an 8PSK 2-state RSSE. One 
sees that in Figure 2, the number of states is smaller than the number of branches 
per state and thus parallel transitions occur (four per state transition). 



[0030] The branch metric between states is b m (s, a k ) HI y k - £ h t a k _ f (s) - f(s) \\ 2 , 



L-l 

10 where f(s) may include any corrective terms to compensate for the reduced 
symbol set. 

[0031] Many digital signal processors include one or more hardware 
accelerators (HWAs). A possible HW A is the butterfly HW A for trellis 
calculations in Viterbi decoding. The butterfly structure is shown functionally in 
15 Figure 3. The butterfly structure computes: 
sm x % - mm{sm x +m,sm 2 - m) and 
sm 2 ' = minium, - m, sm 2 + m) 

[0032] Hardware accelerators can be of a number of different varieties. In 
some cases, the HWA can be implemented as a specific instruction that is an 
20 extension of the ISA (Instruction Set Architecture) which executes a complex 
function. The hardware that supports the instruction may be part of the 
processing core or coupled to the processing core. There may also be variations 
on the computations shown above. 

[0033] For purposes of discussion, it will be assumed that the butterfly 
25 structure has the following restrictions: (1) the branch metrics of a butterfly are 
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symmetric, (2) the branch metrics entering a state are of opposite sign and (3) the 
branch metrics are real values. Thus, the branch metric has to be a linear 
function for at least the part containing the parameter corresponding to the bm 
(the symbol for the equalizer). If not, the structure behind the HWA may not be 
5 fully used. 

[0034] Figure 4 illustrates a block diagram 20 showing the operation of the 
RSSE equalizer, indicating areas, which can be improved by the butterfly 
structure hardware. In block 22, parameters and trellis metrics initialization is 
performed. In block 24, the branch metrics are computed, using the benefit of 

10 acceleration from the butterfly structure. These operations are described in 
greater detail below. In block 26, the compare and select at the next states are 
performed, again with the benefit of the butterfly structure. In block 28, the 
different parameters are updated for computation of the following symbol. The 
loop of blocks 24 through 28 is repeated for a frame length. Upon completion of 

15 a frame, the traceback occurs in block 30 to determine the most likely sequence of 
symbols. 

[0035] The following describes modification of the general RSSE methods to 
allow use of the butterfly structure. As described above: 

bm{sA)-\\y k '%Ka k ^s)-m\\ 2 

1=0 

20 The equation can be rewritten as: 

_ M*AHI>>*-I^ with 
S^rB^W-ZW]- Thus ' 

bm(s,a k )=\\S-h 0 a k \\ 2 =\\S)\ 2 +|| h 0 a k f -2Re{h 0 a k S*}. Without loss of 
generality, we can set the coefficient ho=l and for 8PSK constellation \\a k \\ 2 is 
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equal to 1. Consequently, 

bm(s,a k )=\\S\\ 2 -2 Re[V3,S*]. 

[0036] For a given state, the || S \\ 2 is constant. Differences come from the 
targeted symbol a k . The branch metric can be decomposed further by inserting 
5 the real and imaginary parts of the right terms: a k = a x + j-a y , h 0 =h 1! + j ■ h y and 
S = S X + j-S y . Thus: 

bm(s,a k )=\\S\\ 2 -2[a x (h x S x +h y S y ) + a y (h x S y -h y S x )],ormore 

concisely: 

bm(s 9 a k )=\\S\\ 2 ~2[a x Re{h; S} + a y lm{h 0 * S}] 

10 [0037] Taking into account that h and S are related to the initiating state and 
not to the symbol, the equation can be rewritten as: 

bm(s, a k ) HI S || 2 -2[a x (s) + a y (s)] =||. S \\ 2 . -2a x (s) - 2a y (s) , with 

a x (s) = a x Re {h 0 * S} and a y (s) - a y lm{h 0 * S] . 

[0038] Examining this expression, it can be seen that the branch metric is 
15 composed by three terms, the first one is equal for every branch coming out of 
one state, the second and third are linear combinations of ho, the S term (both 
determined by the state) and respectively the real and the imaginary parts of the 
symbol under consideration. 

[0039] All the terms depend on the initiating state s. This excludes any use of 
20 a butterfly structure with two different states (since the butterfly structure has a 
m, -m,-m, m symmetry relation). Thus, the only solution is to use one butterfly to 
compute four transitions issued from one state. This would represent two 
butterflies per RSSE initiating state for an 8-PSK Constellation. 
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[0040] The RSSE applied to the 8PSK constellation, as defined in Figure la-d, 
has an interesting property: each coset with more than one symbol always has a 
symbol and its opposite. This is easily understandable by looking at Figures la-d. 
Furthermore, the 4 diagonal symbols (the ones in the coset Ai(k) in Figure lc) 
5 have coordinates equal to (±u,±u) where u=cos(7i/4). As shown above, the branch 
metric may be decomposed in three terms with the two last terms being a linear 
function of the real and imaginary parts of the symbol under consideration. This 
second property allows use of the butterfly by conceptually changing the notion 
of bm for the equalizer. The constant part ||iS| 2 of the RSSE bm and the positive or 
10 negative version of 2a x (s) are added to the old state value in order to define two 
Trellis states. The branch metric is equal to 2a y (s). 

[0041] As shown graphically in Figure 5, the Butterfly structure can be used 
to compute branch metrics by setting the butterfly HWA values as follows: 

smi = snioid + |S||i-2a*(s); 
15 smi = snioid + ||s|| 2 +2a x (s) 

m = 2a y (s) 

[0042] The butterfly structure computes the transition corresponding to four 
diagonal symbols of the 8PSK constellation by adding and subtracting their real 
20 and imaginary components. The minimum of the two branch metrics determines 
the best symbol for each coset. It should be noted however, that in this form the 
butterfly structure is applicable only to cosets that have the symbols with 
coordinates (±a x , ±fly), a x and % being two real numbers. 

[0043] Also as shown in Figure 5, in this form, only the two diagonal cosets 
25 can be computed by the butterfly structure. 
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[0044] A way to use the butterfly structure for the computation of all 
minimums of two value cosets in 8-PSK constellations is to operate a change of 
referential. The role of the equalizer is to produce hard decisions. A change of 
referential will therefore not affect the other parts of the DBB (digital baseband). 
5 A tc/8 counterclockwise rotation of the constellation shows interesting properties 
concerning the 8-PSK constellation symbols. As shown in Figure 6, a tt/8 
counterclockwise rotation produces symbols that are determined by only two 
real values a and /?. 

[0045] After the rotation, there are two, four-symbol groups ({Ao(k), A3(k)} and 
10 {Ai(k), Ai(k)}) that fulfill butterfly conditions exposed above. Thus, the butterfly 
structure can be used for all transitions and minimum computations in such a 
configuration as shown in Figure 7. 

[0046] Using a ti/8 counterclockwise rotation (any rotation of (2/c+l)*7t/8/ 
where k is an integer can be used), the butterfly HWA can be applied with the 
15 following values: 

For the first butterfly of symbol groups [Ao(k),A3(k)}: 
a x (s) = /3Re{h 0 *S} and a y (s) = alm{h 0 *S} 

smi = smoid + ||s|| 2 -2pRe{/z*S} 
sm 2 = sm 0 id + ||s|| 2 +2pRe{/z*S} 
20 m = 2alm{h*S) 

For the second butterfly of symbol groups {Ai(k),A2(k)} 
a x ( s ) = a Re{h 0 * S} and a y (s) = f3 lm{hj S} 

smi - smoid + ||S|| 2 -2aRe{/i*S} 
smi = smoid + ||^|| 2 +2aRe{/i*S} 
25 m = 2(3Im{/z*S} 
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with a - cos(;r /8) , J3 = sin(;r /8) and 

Re{h*S] - h x S x +h S ,and \m{ti S} = h x S y -h S x 



[0047] A pseudo-code implementation for a modified 8PSK 2-state RSSE 
method is provided below: 
5 Initialize (Rotation of y n , hk) 

Repeat L times { 

For each RSSE state (2 times) { 
Compute Sand | |S| | 2 . 
Compute /z*S Re{} and Im{}. 
10 For each butterfly (2 times) { 

Compute sm\ and smi 
Compute bm 

) 

} 

15 Perform 8 Butterflies. 

Update. } End repeat L times 
Traceback (TBD) 

[0048] The n/8 rotation method has some implications: (1) n/8 rotation of 
received samples yi , (2) tt/8 rotation of channel estimation hk and (3) only 
20 applicable when J k < and when the constellation coset have symbols of 

opposite signs, 

[0049] Using a butterfly structure in the RSSE method can significantly 
reduce the computing resources needed for RSSE calculations, while having no 
effect on the performance of the RSSE function. 

25 [0050] It should also be noted that the RSSE method described above could be 
used in either a loss-less or a loss variation. In the loss-less variation, both y 
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(received symbol sample vector) and reference constellation symbols are rotated 
by angle equal to (2fc+l)*7r/8 with k being a whole number. The angle may vary 
depending upon the number of symbols in the constellation. 

[0051] In the loss variation, both y (received symbol sample vector) and 
5 reference constellation symbols are rotated by angle equal to (2/c+l)*7i/ 8. Then, 
either the reference constellation or y is rotated by an angle a bit higher or a bit 
smaller than the first one. The error is proportional to the absolute difference of 
the rotation angle between y and the reference constellation symbols. 

[0052] Figure 8 illustrates a block diagram of a mobile communication device 
10 50 using a DSP 60 for communications support. Information is received over 
antenna 52, which is converted to symbols yk by digital baseband circuitry 54. 
The symbols are received by the DSP 56, which rotates the received symbols 
and/ or reference constellation symbols, to generate branch metrics using the 
hardware accelerators 58 in support of : RSSE processing as described above. A 
15 memory subsystem 60 and input/ output subsystem 62 support the processing 
core. 

[0053] An alternative way to the constellation rotation solution discussed 
above in connection with the computation of the four axis symbols (cosets Ao and 
A2 in Figure 5) is to consider these symbols in the form of a k = a 1 -e~ j7r/A . Thus, we 
20 get fl£(j) = <ii Refe-^V^and a$(s) = a' y \m{e' j7rlA h^ S) , where a[ and^ are 
coordinates of a diagonal symbol (in coset Ai and A3 in Figure 5). In these 
expressions, (±a x , ±a y ) properties for axis symbols are assured and therefore 
butterfly structures can be used. Real and Imaginary parts are modified by the 
multiplication of the e~ J;r/A coefficient. 

25 [0054] Although the Detailed Description of the invention has been directed 
to certain exemplary embodiments, various modifications of these embodiments, 
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as well as alternative embodiments, will be suggested to those skilled in the art. 
The invention encompasses any modifications or alternative embodiments that 
fall within the scope of the Claims. 
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